ML-As-5

Q1

Given a set of 5 samples:

X = [\begin{matrix} 0 & 0 & 1 & 5 & 5 \\ 2 & 0 & 0 & 0 & 2 \end{matrix}]

Try the k-means clustering algorithm to cluster the samples into 2 classes.

Consider Data point: $(0, 0), (5, 0)$

Data Point	Distance to $(0, 0)$	Distance to $(5, 0)$	Cluster
(0, 2)	2	$\sqrt{29}$	(0,0)
(0, 0)	0	5	(0,0)
(1, 0)	1	4	(0,0)
(5, 0)	5	0	(5,0)
(5, 2)	$\sqrt{29}$	2	(5,0)

Data Point	Distance to $(\frac{1}{3}, \frac{2}{3})$	Distance to $(5, 1)$	Cluster
(0, 2)	$\frac{\sqrt{17}}{3}$	$\sqrt{26}$	(0,0)
(0, 0)	$\frac{\sqrt{5}}{3}$	$\sqrt{26}$	(0,0)
(1, 0)	$\frac{2 \sqrt{2}}{3}$	$\sqrt{17}$	(0,0)
(5, 0)	$\frac{10 \sqrt{2}}{3}$	1	(5,0)
(5, 2)	$\frac{2 \sqrt{53}}{3}$	1	(5,0)

The cluster does not change after the second iteration. The final cluster assignments are:

Class-1: $(0, 0), (0, 2), (1, 0)$ Class-2: $(5, 0), (5, 2)$

Q2

Suppose there are three coins, denoted A, B, and C. The probabilities of these coins coming up heads are π, p and q. Conduct the following coin toss test. First, toss coin A and select coin B or coin C according to its result, with coin B being selected for heads and coin C for tails. Then toss the selected coin, with the result recorded as 1 for heads and 0 for tails. Repeat the test n times independently (here, n = 10). The observation results are as follows:

1, 1, 0, 1, 0, 0, 1, 0, 1, 1

Suppose that only the result of the coin toss can be observed, but not the process of tossing. The question is how to estimate the probability that all three coins will come up heads, i.e., to find the maximum likelihood estimation of the model parameters θ = (π, p, q).

(Assuming that the initial value of the model parameter is $π (0) = 0.46, p (0) = 0.55, q (0) = 0.67$ , you can use python to calculate the results).

The likelihood of observing the data is:

P (x_{i}) = π \cdot P_{B} (x_{i}) + (1 - π) \cdot P_{C} (x_{i})

ℓ (θ) = \sum_{i = 1}^{n} \ln (π \cdot p^{x_{i}} (1 - p)^{1 - x_{i}} + (1 - π) \cdot q^{x_{i}} (1 - q)^{1 - x_{i}}) .

γ_{B, i} = \frac{π \cdot P_{B} (x_{i})}{π \cdot P_{B} (x_{i}) + (1 - π) \cdot P_{C} (x_{i})}

γ_{C, i} = \frac{(1 - π) \cdot P_{C} (x_{i})}{π \cdot P_{B} (x_{i}) + (1 - π) \cdot P_{C} (x_{i})}

P_{B} (x_{i}) = p^{x_{i}} (1 - p)^{1 - x_{i}}

P_{C} (x_{i}) = q^{x_{i}} (1 - q)^{1 - x_{i}}

π = 0.46, p = 0.53, q = 0.65

p (g e t 3 h e a d s) = 0.46 \times 0.53 \times 0.65 = 0.16

Q3

With the known observation data $- 67, - 48, 6, 8, 14, 16, 23, 24, 28, 29, 41, 49, 56, 60, 75$ , try to estimate the parameters $(α_{0}, μ_{0}, σ_{0}, α_{1}, μ_{1}, σ_{1})$ of the two-component Gaussian mixture model.

Initialization:

Randomly initialize the parameters $(α_{0}, μ_{0}, σ_{0}, α_{1}, μ_{1}, σ_{1})$ .
The weights $α_{0}$ and $α_{1}$ must satisfy $α_{0} + α_{1} = 1$ .

E-step (Expectation step):

Compute the responsibility $r_{i, k}$ for each data point $x_{i}$ belonging to the $k - t h$ Gaussian component:

r_{i, k} = \frac{α_{k} \cdot N (x_{i} ∣ μ_{k}, σ_{k}^{2})}{\sum_{j = 0}^{1} α_{j} \cdot N (x_{i} ∣ μ_{j}, σ_{j}^{2})}

where $N (x ∣ μ, σ^{2})$ is the probability density function of the Gaussian.

M-Step (Maximization step):

Update the parameters using the responsibilities:

$α_{k} = \frac{1}{n} \sum_{i = 1}^{n} r_{i, k}$
$μ_{k} = \frac{\sum_{i = 1}^{n} r_{i, k} x_{i}}{\sum_{i = 1}^{n} r_{i, k}}$
$σ_{k}^{2} = \frac{\sum_{i = 1}^{n} r_{i, k} (x_{i} - μ_{k})^{2}}{\sum_{i = 1}^{n} r_{i, k}}$

Iterate:

Repeat the E-step and M-step until the parameters converge or the change is below a small threshold.

Component 0:

$α_{0} = 0.1332, μ_{0} = - 57.51, σ_{0} = 9.50$

Component 1:

$α_{1} = 0.8668, μ_{1} = 32.98, σ_{1} = 20.72$

Algorithm

Tutorial

assignment

Assignment

As-1

As-2

Lab-1

Lab-2

Lab-3

Lab-4

GAMES101

Assignment-1

Assignment-2

Assignment-3

Assignment-4

Lab

Lecture

Peoject

CSCN

Ploidy

ML-As-5 ​

Q1 ​

Q2 ​

Q3 ​

ML-As-5

Q1

Q2

Q3